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ABSTRACT 

Supermassive black hole binary systems (SMBHB) are standard sirens - the gravita- 
tional wave analogue of standard candles - and if discovered by gravitational wave 
detectors, they could be used as precise distance indicators. Unfortunately, gravita- 
tional lensing will randomly magnify SMBHB signals, seriously degrading any distance 
measurements. Using a weak lensing map of the SMBHB line of sight, we can estimate 
its magnification and thereby remove some uncertainty in its distance, a procedure 
we call "delensing." We find that delensing is significantly improved when galaxy 
shears are combined with flexion measurements, which reduce small-scale noise in re- 
constructed magnification maps. Under a Gaussian approximation, we estimate that 
delensing with a 2D mosaic image from an Extremely Large Telescope (ELT) could re- 
duce distance errors by about 30-40% for a SMBHB at z = 2. Including an additional 
wide shear map from a space survey telescope could reduce distance errors by 50%. 
Such improvement would make SMBHBs considerably more valuable as cosmological 
distance probes or as a fully independent check on existing probes. 
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1 INTRODUCTION 

Recently there has been growing interest in the potential 
future use of coalescing compact binary systems as high- 
precision cosmological distance indicators. During their in- 
spiral phase, prior to coalescence, the amplitude and fre- 
quency of the gravitational wave emission from these sys- 
tems varies rapidly with time, following a characteristic 
'chirp' waveform that is strongly dependent on the binary 



In a seminal paper, Schutz (|l986l ) showed how measure- 
ment of the amplitude, frequency and frequency derivative 
of the inspiralling binary, using observations carried out by 
a network of interferometric gravitational wave detectors, 
could yield a precise estimate of its luminosity distance - 
completely independent of the traditional cosmic distance 
ladder. More recently these binary systems have been la- 
belled 'gravitational wave standard sirens' - by analogy with 
electromagnetic standard candles (although, strictly, they do 
not require the assumption that all sources have the same 
gravitational wave luminosity). 

Siren candidates include neutron star-neutron star bi- 
naries, belie ved to be associated with short-duration gamma 
ray bursts (|Eichler et al.|[l989l ). These systems are among 
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the prime targets for detection by the next generation 
of ground-based interferometers, Advanced LIGO and Ad- 
vanced VIRGO, which should de tect up to a few doze n 
such bin ary coalescences per ye ar (|Kopparapu et al]|2008l ). 
Recently Nissanke et all (|2009l ). extending earlier work by 
iDalal et alj (|2006t ) . have shown that the advanced detectors 
will be able to determine the luminosity distance of these 
sirens to an accuracy of better than 30%, out to a distance 
of 600 Mpc. 

Even more promising siren candidates are the merg- 
ers of supermassive black hole binaries (SMBHBs). These 
systems are expected to be extremely luminous gravita- 
tional wave sources and will be prime observational tar- 
gets fo r the planned Lase r Interferometer Space Antenna 
(LISA; iBender et aI1ll994f) . Moreover, the issues addressed 
in this paper are of particular importance for SMBHBs; con- 
sequently they will be the principal focus in what follows. 

The expected number and redshift distribution of SMB- 
HBs that will be observed by LISA is very uncertain, be- 
ing strongly dependent on the details of our model for the 
histo ry of galaxy mergers (|Sesana et al.l 120071 ; lArun et al.l 
120091 ). However, the impact of observing even a handful of 
SMBHBs could be ver y significant, as first demonstrated by 
iHolz fc Hughes! (|2005l . hereafter HH05). Those authors per- 
formed a Monte Carlo study to calculate the fractional accu- 
racy with which luminosity distance can be determined for 
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sirens of a range of masses and at different reshifts, randomly 
distributed in orientiation and sky position. Assuming that 
the sky position of the SMBHB could be determined ex- 
actly from electromagnetic observations, HH05 found that, 
for e.g. the merger of a 10 5 solar mass and 6 x 10 solar 
mass black hole at z = 1, the most probable fractional er- 
ror on luminosity distance was 0.1%. Moreover, the same 
binary system observed at z — 3 would yield a most proba- 
ble fractional error of 0.5%. HH05 then demonstrated the 
dramatic cosmological potential of these high-precision dis- 
tance indicators: only two sirens observed at z = 1 and z — 3 
could constrain cosmological parameters at a level compet- 
itive with 3000 type la supernovae. 

As pointed out by HH05, however, there is a huge 
caveat: standard sirens (like standard candles) will be 
(de-)magnified by weak lensing caused by matter fluctua- 
tions along their line of sight. The observed luminosity dis- 
tance to a siren is related to its true luminosity distance 
by 



D? S = D L M - 1/2 kD l (1-6h/2) 



(1) 



where fj, is the lensing magnification and Sfi = fi — 1 = for 
an unlensed source. The scatter in 8fi is expected to be sev- 
eral percent for high-redshift sirens; this adds in quadrature 
to the sirens' intrinsic scatter, substantially degrading their 
effectiveness as cosm ological probes. Moreover, as shown by 
iKocsis et al.l l |2006t ). weak lensing will also significantly in- 
crease the extent in redshift of the 3-D error box determined 
by LISA - thus rendering much more difficult the task of 
identifying an electromagnetic counterpart. 

Subsequent work on SMBHBs has mainly focussed on 
better localising an d understanding their electromagnetic 
signa tures (see e.g. IKocsis fc Loebl Bool lLang fc Hughes! 
120061 ) and further reducing their i n trinsic scatter in the 
absence of lensing. Lang fc Hughes! ((2006) included spin- 
induced precession (|Vecchidl2004 ) in their model of the siren 
waveform and showed that this significantly reduced the me- 
dian luminosity distance error, and angular size of the LISA 
error box, because the precession terms break the degen- 
eracies between other waveform parameters. Other authors 
(see e.g. lPorter fc Cornishl20o3 ) have found similar improve- 
ments through the inclusion of higher-order har monics in the 
siren waveform, and recently I Arun et al.l (|2009l ) have consid- 
ered both precession and higher harmonics in their modelling 
of SMBHBs, with impressive results. 

Notwithstanding this progress, however, the damaging 
impact of weak lensing on the performance of sirens has 
remained larg ely unaddresse d - alt hough a notable excep- 
tion has been iJonsson et all (|2007T ). which built on earlier 
work exploring weak lensing co rrections to standard can- 
dles (|Jonsson et al.l [20051 . l2006al fbl). Their method proposes 
to model, and thus correct for, lensing magnification by us- 
ing the observed photometric and spectroscopic properties 
of foreground galaxies along the line of sight to the siren. 
The authors consider the case where the lensing signal is 
dominated by a single dark matter halo; the magnification 
factor is computed from an inferred halo mass profile, which 
is in turn constrained by photometric and spectroscopic data 
using the Tully-Fisher or Faber-Jackson relations. The au- 
thors' results are quite impressive, showing that the disper- 
sion due to lensing for a standard candle or s iren at z = 1.5 
can be reduced by about a factor of two (jjonsson et al.l 



l2009h . However, their method does involve a number of spe- 
cific modelling assumptions about halo mass profiles and the 
observable mass proxy. 

In this paper we pursue an alternative approach which 
makes fewer modeling assumptions: we propose calculating 
weak lensing corrections derived from gravitational lensing 
shear and flexion maps constructed in the direction of the 
SMBHB. In other words, we directly measure the siren mag- 
nification in order t o rem ove it, a procedure we call "delens- 
ing". Dalai et al. (|2003l . hereafter D03) have shown that 
performing such corrections for large surveys of type la su- 
pernovae (SNela) is unlikely to be worthwhile. Although 
SNela surveys conveniently provide galaxy shape measure- 
ments that could be used to delens each supernova, they will 
not have galaxy densities high enough to resolve small-scale 
contributions to the magnification. 

We find at least three reasons to reconsider delensing in 
the case of SMBHBs: 

• Since we expect SMBHB detections to be rare, we can 
furnish each one with a very deep pointed follow-up obser- 
vation, thus obtaining the high galaxy densities needed for 
high-resolution shape maps. 

• Galaxy images from devoted observations could enable 
higher quality shear and flexion measurements than those 
available from a survey telescope. We will show that high- 
quality flexion measurements are a substantial advantage. 

• Delensing will make more of an impact on SMBHB dis- 
tance measurements than on SNela measurements. Lensing 
will be by far the largest source of scatter in SMBHB dis- 
tances, whereas SNela retain a significant intrinsic scatter 
even after delensing. In addition, since surveys will detect a 
very large number of SNela, the magnification noise will be 
partially averaged away. 

In this paper we investigate the efficacy of delensing 
SMBHBs with high-quality shear and flexion maps. In 
we briefly review the Kaiser-Squires technique for recon- 
structing a magnification map from shape measurements. 
In 331 we present our methodology for estimating how pre- 
cisely a SMBHB magnification can be measured from a set of 
shape maps. We also demonstrate the advantage of combin- 
ing shear and flexion measurements to substantially reduce 
shape noise. In i|4] we compute the reduction in the scatter 
in SMBHB distances that could be achieved by delensing 
with an Extremely Large Telescope (ELT) . We further show 
that delensing can be improved by combining ELT maps 
with a wide space telescope survey. We conclude in [JS] along 
with a discussion of topics for further investigation. 



2 MAPMAKING 

In order to improve our estimate of the luminosity distance 
to the siren, Dl, we need to obtain an accurate estimate 
of the lensing magnification /x(x) at the siren's position x 
on the sky. If the siren signal has undergone weak lensing, 
this magnification is related to the lensing convergence k 
by jj, ~ 1 + 2k. So if we can make an accurate convergence 
map at and around position x, we have an estimate of the 
magnification in that region. 

Convergence maps can be constructed from shear 
measurements using the Kaiser-Squires (KS) inversion 
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meth od (|Kaiser fc Squired 1 19931 ; iBartelmann fc Schneider! 
2001). Since shear and convergence are both combinations 
of derivatives of the lensing potential, the convergence can 
be estimated from the shear by averaging shear estimators 
in pixels on a grid, Fourier transforming to obtain shear es- 
timators and finding Fourier-transformed convergence 
R according to 



k(l) 



,2 ,2 



(2) 



This can then be inverse Fourier transformed to obtain a k 
map, and hence fj, map, including the value at x. 

Higher order shape distortio ns can similarly be inverted 
to create a k map. Bacon et al. (|2006l ) showed that flexion 
measurements, characterizing the slight arc of images, can 
be inverted in a fashion similar to Kaiser-Squires, with the 
prescription 
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(3) 
(4) 



where T is the 1-flexion, representing a skew to the galaxy 
shape, while Q is the 3-flexion, representing the trefoil com- 
ponent of the galaxy. 

If we had perfect knowledge of the 7, T and/or Q field on 
all scales, and at the redshift of the SMBHB, this approach 
would yield a perfect map for /j, and hence a perfect correc- 
tion for Dl - However in reality our measurements of lensing 
distortions are noisy, exist only on lines of sight which con- 
tain a galaxy, cover a range of redshifts, and are for a limited 
patch of sky, so our fi map will be imperfect. 

Firstly there is the impact of noisy shape measurements. 
We cannot measure the true shear or flexion of a galaxy; 
rather, we measure an estimator for these quantities, includ- 
ing the noise due to the galaxy's intrinsic shape (ellipticity, 
skew etc), background noise from the sky and the camera, 
and noise due to an imperfect deconvolution of the PSF. 
This means that each galaxy's estimate for shear or flexion 
has a net error denoted by <j 7 , ajr and ag respectively; this 
includes the dispersion from all the above effects. Then the 
error on the signal in a pixel on our shear/flexion map will 
be a I -^/rigai^lpix, where a is the galaxy-by-galaxy error in 
the relevant distortion, n ga i is the number density of galax- 
ies and Api x is the area of the pixel. In i !3.2l we will see how 
this translates to an error on k in Fourier space. 

We note that the finite number density of galaxies acts 
as a hard limit on the scales we can probe for /1; if there are 
density fluctuations on scales smaller than the inter-galaxy 
spacing, we will not be able to estimate their lensing effect. 
As shown by D03, this is a limiting factor when delensing 
SNela since Cold Dark Matter models predict a significant 
level of sub-arcminute fluctuations in (x. On a related note, 
we will have to smooth our maps on small scales to overcome 
the intrinsic shape noise; again, this will restrict the scales 
on which we can describe /x. 

In addition to these small-scale inaccuracies, a realistic 
survey will have a finite size on the sky. This means that 
large-scale density modes, which again contribute to the to- 
tal n, will not be estimated by our survey. A related issue is 
the mass-sheet degeneracy; k estimated from Kaiser-Squires 
inversion is degenerate with (1 — (3)n+f3 for a constant f3. For 



large fields /3 — > 0, but small surveys need to take account 
of this effect. Our method for estimating errors in p must 
therefore deal with small scale smoothing and large-scale 
cut-offs in our data. In addition, the inversion equations es- 
timating k from shear and flexion should be combined in an 
optimal fashion, as we explain in i )3.2l 

Already it can be seen that ideally we require an am- 
bitious lensing follow-up programme for sirens. This will in- 
clude observations of the immediate region of the siren, with 
the best possible resolution (to minimise the <rs) and depth 
(to maximize n ga i); this will allow us to probe the small-scale 
contributions to ft. In addition, we preferably require a large 
survey (still with considerable image resolution and depth) 
around the siren position, to probe the large-scale contribu- 
tions to fi. These requirements could be fulfilled by using an 
Extre mely Large Telescope (|ELT Science Working Group! 

with adaptive optics to image the central region, to- 
gether with a survey space telescope to give the large-scale 
map. We will explore the consequences of this strategy in 

S3I 



3 METHODOLOGY 

3.1 Estimating reduction in lensing scatter 

Consider a circular field of radius O containing galaxies with 
a redshift distribution p ga \(z). Suppose we perform a KS-like 
inversion on shear and flexion maps of the field after smooth- 
ing these with a radially symmetric filter with scale-radius 8 
(filter can be a top-hat, Gaussian, etc.). The reconstructed 
convergence at sky position x is 



,(x) = K fl (x)-ii-(e) + i/ e (x;e) 



(5) 



Here, ng is the true convergence of our galaxies smoothed by 
our filter, vg is filtered shape noise, and K (O) is a constant 
determined by fluctuations larger than the field area. The 
mass-sheet degeneracy prevents us from determining this 
constant from shape measurements alone (see appendix |A")) . 
A simple theoretical estimate of K is the true convergence 
field at the center of our map, smoothed on the scale of our 
map: 



A-(e) « K ©(x ) 



(6) 



Note that the noise at x depends on all galaxies in the survey 
area, not just those within the smoothing scale, because of 
the non-local properties of the KS inversion. 

Let there be a SMBHB with redshift zbhb located at 
xbhb, and let kbhb be the true convergence along the line 
of sight. Our uncertainty in the true convergence, ct(kbhb), 
is initially (i.e. with no lensing map) given by 



c(kbhb) = ( k Ihb) 



1/2 



(7) 



where angular brackets denote an ensemble average. Once 
we map the convergence field, we replace our uncertainty 
with a smaller one, a 1 (kbhb), as the lensing map allows us 
to remove some of the true convergence, leaving a residual 
due to the noise, smoothing, and broad redsh ift range of the 
lensing map. Following D03 and Bower (|l99ll ). the expected 
improvement is given by 



ct'(kbhb) 2 = (1 - »" 2 )o-(kbhb) 



(8) 
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(KmapKBHB) 2 
( K BHb) ( K map) 



(kbkbhb — K9Kbhb)' 



bhb) 



ti-4 + "«(©) a ) 



(9) 



where dependence on xbhb has been omitted. Thus r 2 can 
range from zero to unity with r 2 — 1 when our map perfectly 
reconstructs the convergence at xbhb- We are assuming that 
the lensing fields are Gaussian random; hence we have as- 
sumed that fluctuations within the map don't correlate with 
those larger than the survey area: (Kmapie) = 0. We also ig- 
nore intrinsic alignments which would correlate lensing with 
shap e noise jC rittcndcn et alj 1200 fl ; iHevmans fc Heavens! 
120031 ; iHirata fc Seliakl |2004| ). and we igno re higher order 
lensing effects such as reduced shear /flex ion (|Dodelson et al.l 
l200d : IWhitc 200l; ISchneider fc Er|[2008h . These are simpli- 
fications that should be revisited in future work. 

The various correlations "(kk)" in ([9]) can be calculated 
using the following general expression: 



2tt 



{ Ki K j ) = — / dnp K (i-i,j)m)F 3 (i) 



(10) 



for i,j £ {6,0, BHB}. The multipole / is the Fourier con- 
jugate to x, and we are working with the flat-sky and Lim- 
ber approximations. P K (l;i,j) is a generalized convergence 
power spectrum, defined below. Fi is the Fourier trans- 
form of the appropriate filter function for m: if we assume 
Gaussian-smoothed shape maps and a top-hat-shaped sur- 
vey, then 



Foil) 



Fe(0 = ^|^, ^bhb(I) = 1- (11) 



The generalized convergence power spectrum is given by an 
integral over comoving distance 



Wi( X ) 
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(12) 
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where Ps (k; x) is the 3D matter power spectrum for k — l/x 
at a distance Xi accounting for the growth of structure. The 
scale factor is a = (1 + z) , and xh is the distance to 
the horizon. We have assumed a flat Universe for simplicity. 
Here, pt(z) is the redshift distribution of the sources: 



pe(z) = p&(z) = Pgal(z), Pbhb = S D (z 



zbhb J 



(14) 



where the Dirac delta function, 8n, should not be confused 
with the matter overdensity, S. Using equations (|f 0[1 - (|14[) . 
we can calculate r 2 from ©; we only need (ve{®) 2 ^), the 
variance at a point in our convergence map due to noise. 



3.2 Shape Noise in a KS Inversion with Flexion 

Assuming that our noise is dominated by Poisson noise from 
intrinsic galaxy shapes, the noise power spectra are 
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(15) 
(16) 
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where n ga i is the projected number density of galaxies and 
<r 7 , ap, or ag is the RMS shape for a single galaxy. If 
we reconstruct the convergence from a minimum-variance- 
weighted linear combination of the shear and flexion (as in 
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Figure 1. Variance due to shape noise for a k map reconstructed 
from shape maps smoothed by a Gaussian filter of width 8. The 
dashed line assumes shear only while the solid line adds both 
types of flexion. The RMS shapes are those expected for an 
Extremely Large Telescope: <r 7 = 0.2, ajr = 0.5/arcmin, erg = 
0.9 /arcmin 



lOkura et J] 12007? ) . then the noise power spectrum of our 
convergence map will be 

( i i 2 i 2 V 1 

PT*\l) = — L- + -L- + -i^- . (18) 

v ' I pnoise Dnoisc Dnoisc / v > 

The different ^-dependencies of shear and flexion noise are 
explained in Appendix [B] For shape maps smoothed by a 
gaussian filter, the shape noise contribution to ^K^ap) is 

1 f°° 

Cp{9) = i J dll MlfPr isc (l) (19) 

where the lower limit of integration, / = 0, implies a very 
large map (we will consider small maps presently). Substi- 
tuting (|18[) into (|19[) and defining 

—2 —2 —2 

°>s = °> + a g 
we find 
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where F(s,x) is the incomplete gamma function 



T(s,x) 



In the shear-only limit, org — > oo, the property 
lim r(s, x)e x x 1 ~ s — 1 



(22) 



(23) 



(24) 



leads to C v (8) ~ <r 2 /(7rfl 2 n ga i) , as expected for a KS in- 
version (jSeitz fc Schneidedl 19951 ) . Decreasing the smoothing 
scale, 9 — > 0, we see that small-scale noise leads to a di- 
vergence in the shear-only limit; hence we must smooth to 
obtain finite noise. In the flexion-only limit, <j 7 — » oo, C v {9) 
diverges due to large-scale noise, which can be seen by ig- 
noring the shear term in l|2ip . For finite maps, there are no 
noise contributions from modes larger than the map, so 



<^(x;9) 2 ) =C P (8)-C P (0) 



(25) 
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Although we are considering a tophat-shaped map, and the 
excluded noise C p (0) assumes a large Gaussian filter, we 
note that for large enough 0, we recover the shear-only limit, 
which is the same for either filter function. 

When shear and flexion are combined, they attenuate 
shape noise in a complementary way. We can see in (|21|1 that 
the shear term prevents a divergence toward small I while the 
flexion term reduces the divergence toward large I. Without 
the smoothing factor, there is an ultraviolet divergence even 
with flexion, but it is less severe than for shear. The benefit 
of a combined approach is illustrated in Figure [1] which 
compares shape noise from a shear-only inversion to one that 
includes flexion. Since flexion suppresses small-scale noise, 
we can smooth our shape maps on smaller angular scales 
without significantly increasing the noise in our convergence 
map. Thus, our convergence map will recover more small 
fluctuations, leading to a larger (kskbhb) and therefore a 
larger r 2 by @. Of course, this improvement degrades as 
we increase and erg. 



4 RESULTS 

For all calculations, we adopt a flat, ACDM model with h = 
0.7, fi m = 0.27, Q b = 0.045, <r 8 = 0.8 and n 3 = 0.96, con- 
sisten t with the WMAP 5-year parameters (|Dunklev et al.l 
120081 ). We calculate the linear matter power spectrum using 
the fitting formula of Eisenstein & Hu (jl999T ) without baryon 
wiggles, and we appl y to this the nonlinear fitting formula 
of Smith et al. (|2003l ). The weak lensing fields are assumed 
to be Gaussian and uncorrelated on different scales. For our 
cosmological model, we expect that without delensing, the 
uncertainty in the lensing of a SMBHB, <t(k) = a(/j,)/2, will 
be 3.9% for zbhb = 2 and 5.2% for zbhb = 3. By ([T]) this un- 
certainty is equal to the relative distance error, u{Dl) / Dl- 



4.1 Distance error reduction from a deep image 

We first consider delensing with a narrow, deep image of 
a similar size and depth to the Hubble Ultra Deep Field 
(HUDF). The HUDF contains over 8000 galaxies detecte d 
at over IOct within an area of 12 arcmin 2 (|Coe et al.ll2006h . 
We take our galaxy redshift distribution to be 



/ \ a -(z/zO) 13 

p ga i(z) oc z e w 



(26) 



with a = 0.8, j3 = 2.0 and z me d = 1.8 in order to approx- 
imate the redshift distribution of the HUDF. However, we 
suppose that we have the resolution provided by adaptive 
optics with an Extremely Large Te lescope (ELT) of 50mas 
|ELT Science Working Group||2006l ). We estimate that such 
an instrument will provide RMS galaxy shapes of 

0.5 0.9 



= 0.2, a T = 



ag = 



(27) 



where the covarianc e between these measures is negligible 
l|Massev et al.l [20071 . and Rowe et al. 2009 in preparation). 
With an ELT, we can consider going deeper than the HUDF 
so as to obtain a denser galaxy field. However, since galax- 
ies are roughly 1 arcsec 2 in size, we will become confusion- 
limited as we approach n ga i ~ 1 /arcsec 2 = 3600/arcmin 2 . 

As D03 point out, there is generally an optimal smooth- 
ing angle for a shear map. If the smoothing is too fine, the 



map will be very noisy; if smoothing is too coarse, the small- 
scale magnification fluctuations will be washed away. When 
adequate flexion measurements are included with the shears, 
noise can be reduced enough to make the optimal smooth- 
ing scale smaller than the inter-galaxy spacing. Of course, it 
makes no sense to smooth on scales smaller than the inter- 
galaxy spacing; we therefore choose a conservative cutoff of 



n ga i7r6> 



10 



(28) 



For example, for a galaxy density of n ga i = 800/arcmin 2 , 
we would use = 0.063 arcmin. Figure [T] shows that at this 
scale, including flexion reduces the variance in the noise by 
more than an order of magnitude relative to shear alone. 

We define the distance error remainder, 1Z, to be the 
SMBHB distance error after delensing divided by the origi- 
nal distance error: 



(29) 



Since o(Dl) oc ct(k), the equation for 1Z follows directly 
from ©: 



TZ= VI - r< 



(30) 



Figure [2] contains contour plots showing the 1Z we could ob- 
tain from shape maps of a given size and galaxy density. 
The solid contours assume a HUDF-like galaxy redshift dis- 
tribution, given by (|26|l . while the dashed contours assume 
that all galaxies lie in a plane at the redshift of the SMBHB. 
The latter configuration is clearly a best-case scenario and 
provides an upper limit on how well a map can measure the 
SMBHB magnification. 

As an example, consider the plot with zbhb = 2 in 
Figure [2] Going only by the solid contours, the "HUDF" 
label lies near 1Z = 0.7. Therefore we could use an HUDF- 
like ELT image to reduce the SMBHB distance uncertainty 
to about 70% of its original value (i.e. we reduce the er- 
ror bar by 30%); this assumes that we use no redshift in- 
formation about individual galaxies and do not attempt to 
break the mass-sheet degeneracy. In this case, the remaining 
uncertainty comes primarily from convergence fluctuations 
on scales larger than our map. While we can improve our 
measurement of the SMBHB magnification by widening our 
map, there are diminishing returns on making it deeper. 
With substantial access to an ELT we could create a mosaic 
to reduce our distance error to less than 60% of the original 
value. 

We reiterate that the solid contours in Figure [2] as- 
sume that we only construct 2D weak lensing maps. If we 
can measure the redshifts of individual galaxies and down- 
weight galaxies as they deviate from zbhb, we could im- 
prove on these results. The zbhb = 3 plot highlights the 
importance of the galaxy redshift distribution. Because the 
galaxies have z mcd = 1.8, most of them are closer to us 
than the SMBHB and are therefore lensed by different struc- 
tures. Hence, there is a large discrepancy between the solid 
(realistic) and dashed (idealized) contours. Meanwhile, the 
galaxies are almost evenly distributed around a SMBHB at 
zbhb = 2, so there is a smaller difference between the con- 
tour sets. 
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Figure 2. Contours of TZ, the fraction of distance error remaining 
after measuring a SMBHB's magnification with a narrow, deep 
image. Solid lines assume the galaxy redshift distribution given by 
(126 1 . Dashed lines assume that all galaxies have the same redshift 
as the SMBHB. The point marked "HUDF" is for an ELT image 
similar in size and depth to the Hubble Ultra Deep Field. The 
point marked "ELT" assumes a large tiled image from an ELT. 



4.2 Distance error reduction from hybrid maps 

It is apparent from Figure [2] that we could significantly 
improve our convergence reconstruction with wider images 
around each SMBHB. Unfortunately, it would be impracti- 
cal to use telescopes such as an ELT to create mosaics larger 
than a few tens of sq uare arcminu tes. A spa ce survey tele- 
scope such as JDEM (|NASAl2009f ) or Euclid (|Laureiis et all 
l200ct ) would provide the required width but it would be blind 
to sub-arcminute convergence fluctuations that require high 
galaxy densities. We therefore propose making convergence 
maps from a hybrid of deep, pointed observations and wider 
survey images. The idea here is that the deep images will be 
used to measure small-angle convergence modes, while the 
wide images will pick up modes larger than the size of the 
deep images. 



Accordingly, we relabel 



to /tdecp, and we let «; w id 



denote the convergence map obtained from the larger survey. 
We are unconcerned with small fluctuations outside of the 
deep map. Therefore to minimize shape noise in the wide 
map, we smooth it by a top-hat filter of radius O, the size 
of the deep map: 



Kdeop(x) = Ks(x) - Ke(x) + V 6 (x; 9) 

Kwidc(x) = 4( x ) + ^e( x ) ■ 



(31) 
(32) 



Because each of these terms depends on the galaxy density 
and redshift distributions of the corresponding images, we 
have introduced the * superscripts to signify a dependance 



Figure 3. Solid contours are the same as in Figure [2] Dashed 
contours show the improvement when a shear map from a survey 
space telescope is used in tandem with an ELT. 



on the n g ai and p ga i(z) of the wide survey. For a sufficiently 
wide survey, the shape noise at x is independent of the sur- 
vey area, so we make that approximation here. 

We can split kbhb into terms comprised of modes with 
wavelengths that are smaller or larger than the narrow, deep 
image area: 



kbhb = ks + kl 



(33) 



The subscripts denote the "small" and "large" terms. As 
in we estimate the large term to be equal to the total 
convergence smoothed on the scale of the deep map: 

kl ~ kbhb ; 6 (34) 
We now define "small" and "large" versions of r 2 : 



2 (KdeepKs) 



<«§><«L, P > <«§><*S-«I + 



2 (^widc^L) 

T = 



(35) 



(36) 



KX^ido) K>(«& 2 + ^ 2 }' 

Our final uncertainty in kbhb is now given by 

O-'(KBHB) 2 = (1 - r-;L cp ) + (1 - 7V,id c ) • (37) 

We take our space-based survey to have n ga i = 100 and 



a redshift distrib ution given by H26II with a = 2, [3 = 1.5 
and z mcd = 1.5 (|Albrecht et alj|2006h . We take the intrin- 



sic shear to be a~, 



0.25, but we do not consider flexion 



measurements from the wide survey. Flexion measurements 
will have larger errors (~ 1 per arcmin) with a survey in- 
strument, and since flexion's sensitivity to matter fluctua- 
tions decreases with angular size, it should only marginally 
improve a shear measurement of ■ Figure [3] shows how 
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Figure 4. Uncertainty in SMBHB distance versus SMBHB red- 
shift, with and without delensing. Solid: distance uncertainty 
without delensing. Dotted: reduction expected after shear-only 
delensing with a 30 arcmin 2 image from an ELT (n ga i = 
1000/arcmin 2 , 2 mcd = 1.8). Dashed: shear and flexion delens- 
ing with an ELT. Dot-dashed: shear and flexion delensing with 
an ELT combined with a space survey telescope. 2D shear and 
flexion maps are assumed (no individual redshift measurements). 
Note that this ELT image is smaller than the one marked in Fig- 
ures [2] and [3] 



survey. The degeneracy could also be broken by including 
magnification maps from quasar number counts. Further- 
more, with the advent of an ELT, improved high-resolution 
shape measurement techniques may emerge, leading to maps 
with significantly lower shape noise. 

We have made several simplifying assumptions that 
should be addressed in future work. We have extended the 
Smith et al. fitting formula for the matter power spectrum 
beyond its range of accuracy, and we have assumed that 
matter fluctuations remain Gaussian random. A more com- 
plete treatment of the matter power spectrum, e.g. from the 
results of N-body simulations, will be needed to refine the 
technique of delensing and estimate its potential success (cf. 
Gair, King, et al. in preparation). It may be possible to ex- 
ploit the fact that non-linear gravitational clustering causes 
the true weak lensing fields to contain l ess information than 
their Gaussian random approximations ([Coorav fc Hu|[200ll : 
iRimes fc Hamiltonll2005l ; iDore et al.ll2009l ). Finally, we have 
ignored higher order weak lensing effects such as reduced 
shear/flexion and contaminating effects such as intrinsic 
alignments. 



including a wide map improves on a deep map's ability to 
delens a SMBHB. For z B hb = 2, delensing with a HUDF- 
like image by itself can reduce the distance error by about 
30%, but when combined with the space survey, half of the 
uncertainty can be removed. The improvement is slightly 
less for zbhb = 3; we should expect this since most galaxies 
in the space survey have a redshift lower than 3. Notice now 
that creating a large ELT mosaic provides a little improve- 
ment but will not be as effective as making the image deeper. 
Again, these results assume that we use 2D maps and do not 
attempt to break the mass-sheet degeneracy with informa- 
tion beyond shape measurements. 



5 CONCLUSIONS AND FUTURE WORK 

Gravitational lensing of supermassive black hole binary sys- 
tems (SMBHBs) severely limits their usefulness as stan- 
dard sirens - precise distance probes based on known grav- 
itational wave signals. We suggest that "delensing" each 
SMBHB - i.e. mapping the magnification field around it 
using a Kaiser-Squires-like inversion technique - could sub- 
stantially reduce distance errors due to lensing. Figure [4] 
illustrates the error reduction that could be achieved with a 
30 arcmin 2 image from an Extremely Large Telescope com- 
bined with a space-based survey such as JDEM or Euclid. 
We find that combining 2D maps from these instruments 
could reduce SMBHB distance error from lensing by about 
a factor of 2 for zbhb > 1-5. The success of SMBHB delens- 
ing requires adequate flexion measurements to recover fine 
fluctuations in the magnification field and shear measure- 
ments to recover larger features. 

It may be possible to achieve superior results by using 
techniques beyond the simple Kaiser-Squires inversion con- 
sidered here. For instance, incorporating a redshift for each 
galaxy would enable some improvement over 2D maps. Red- 
shifts could also help break the mass-sheet degeneracy in a 
deep image, thereby eliminating some of the need for a wide 
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APPENDIX A: EFFECT OF THE MASS-SHEET 
DEGENERACY ON DELENSING 

Let Kg be the true convergence of our galaxy sample 
smoothed by a filter with a characteristic angular scale 0. 
Ignoring shape noise, our convergence map, K m a P , is related 
to the true convergence by 



(1 - 0)Ke + P 



(Al) 



for some unknown constant p. This is the mass-sheet degen- 
eracy: shape measurements cannot distinguish the scaling 
factor (1 — p) from a constant sheet of mass with conver- 
gence p. Let us think of Kg as being composed of two terms: 



Kg = Kg s + Kg L 



(A2) 



where Kg s (or Kg L ) is a sum over Fourier modes with wave- 
lengths smaller (larger) than the scale of the map. Note that 
if we restrict our attention to the area of the map, we can 
treat Kg L as a constant, which we call K(Q) for a circular 
map of radius O. 

If we apply an initial transformation (|A1[) to K map to 



obtain a new map with 



3 ) =0, then this new map 



should be proportional to Kg$ and still related to Kg by 
(lAlll . Therefore, since K map = when Kg$ — 0, we have 



■ p = -p/K(@) = [1- A-(e)]- 1 . 



We can now write 

Kgs Kg 



K(0) 



K(Q) 1 - K(@) 



(A3) 



(A4) 
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Thus, our map basically allows us to reconstruct ng s up to 
some multiplicative constant which is determined by fluctu- 
ations larger than the map. For weak distortions (k <C 1), 
we can ignore this factor to first order: 



Kiiisp ~ n$s[l + K(Q)] « KOS = ko — K(&) 



(A5) 



So to first order, our map determines the total convergence 
field modulo some unknown additive constant, K(0). De- 
lensing reduces our uncertainty in ngg but not K(Q), which 
cannot be determined from shape measurements within the 
map. Measuring K(Q) requires that we either widen the map 
or else break the mass-sheet degeneracy using additional in- 
formation in the field (e.g. redshifts). 

The higher order corrections can be significant when 
calculating 2-point statistics. For instance, at second order, 
the term (re map KBHB) in @ becomes 



(^map^BHB } ~ > {^map KBHB ) + (tmapKBEBKe) 



(A6) 



The 3rd order term vanishes for purely Gaussian fields, but 
in reality such corrections can be as much as 10%, depending 
on the amplitu de of the non-linear matter power spectrum 
on small scales (|Whitell2005l ; |Shapirdl20Qgh . We expect the 
particular correction above to be partially suppressed due 
to the low correlation between «: map and Ke, which repre- 
sent different angular scales. Such non-Gaussian/non-linear 
effects will need to be accounted for in a realistic delens- 
ing scheme, but they are beyond the accuracy that we are 
interested in for this paper. 



APPENDIX B: SHAPE NOISE IN A 
KAISER-SQUIRES INVERSION WITH SHEAR 
AND FLEXION 

A Kaiser-Squires-like inversion of shape measurements to re- 
construct the convergence field involves the following steps: 

(i) Measure shear and flexion (7, T , Q) for each galaxy 

(ii) Smooth each shape map with a filter function Fe(x) 

(iii) Fourier transform each smoothed map 

(iv) Convert shear and flexion to convergence in Fourier 
space 

(v) Calculate convergence modes using minimum vari- 
ance weighting 

(vi) Inverse Fourier transform to obtain K map (x). 

Because shear and flexion are converted to convergence dif- 
shape noise in the resulting k map de- 



ferently in step (iv 



pends differently on intrinsic shears and flexions. We now 
briefly review the conversion equations. 

We start by introducing the complex notation for shear 
and flexion: 

7(0 = 7i(0 + *72(«) (Bl) 
^(i) = Fi (Z)+iF 2 (Z) (B2) 
0(1) = 0i(I) +*&(*) (B3) 

Next, borrowing notation from iBartelmann fc Schneider] 
(|200ll ). we define the following complex kernels in Fourier 
space: 



(B4) 



f>g(l) = ^(Jl+iJa) 



(B6) 



where I = \l\ = l\ + 1%. Note that the subscripts on I denote 
the x- and y-components of the wave-vector I relative to 
some fixed i-axis, while subscripts on shear or flexion denote 
polarizations defined by the same a:-axis. Hereafter, we will 
not explicitly write the dependence of quantities on I. 

In Fourier space, for I 7^ 0, weak lensing distor tions are 
related to the convergence by[j l|Bacon et al.ll2006h 



7 = ty 1 kT>-y 







7r 1 kT>j^l 2 



We can therefore use the fact that 

X>*X> 7 = ty 2 

and 

V* T V F = V* g Vg = Tv 2 r 2 



(B7) 
(B8) 
(B9) 

(BIO) 

(Bll) 



to convert shape modes to convergence modes using the fol- 
lowing equations: 



1 map 



= TY 



£map = 7Y UqQ 



(B12) 
(B13) 
(B14) 



The convergence power spectrum of our k map is de- 
fined by 



(« m ap(«)«map(0*) = {2n) 2 8{l -!K. P (!) 



(B15) 



with similar definitions for the shear and flexion maps. Us- 
ing (|B12[) . (|B13I) and (|B14|) . it is easy to show that a con- 
vergence power spectrum obtained from inverting the white 
noise spectrum of intrinsic shear or flexion will contain a 
corresponding factor of T>T>* . The result is 



r ,sc (i) = 



Pg 



(Bie) 

(B17) 
(B18) 



If we compute each k from a linear combination of shear and 
flexion using minimum variance weighting, the final noise 
spectrum is given by 



P« noise (0 



+ 



I 2 



pnoisc 



+ 



Dnoisc 



(B19) 
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